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ABSTRACT 

The purposes of this research were to develop a new 
methodology for measuring comprehension and analyze individual 
differences in perceiving sentence difficulty. Written descriptions 
of commonly occurring visual scenes served as the stimuli in the new 
methodology. The reader's success in drawing a scene after viewing a 
written description of it served as a measure of his comprehension. 
Matierial was presented sequentially in a visual spatial mode or as a 
verbal description. The locations of scene objects were presented 
either in a random or systematiic order, spatially proximal stimuli 
being presented contiguously in the systematic ordering. The results 
indicated that spatial presentation is better than verbal, and that 
systematic is substantially better than random. In addition, moderate 
but consistent relationships between task performance and reading 
ability were found. Two stiudies were conducted to explore individual 
differences in perceiving sentence complexity. Multidimensional 
scaling and otfier more traditional analyses showed a marked 
difference between good and poor readers in their perception of a 
variety of sentence characteristics. (Author/WR) 
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ABSTRACT 

Th 2 rnrjOcxrch can bo divided into two catogoricn: (a) 

davolopmont of n nov/ mothodologv for measuring comorchension , 
and (b) analysis of Individual differences in nerceiving 
sentence difficulty. 

Written descriptions of commonly occuring visual scenes 
serve as the stimuli in the new methodology. The reader's 
success in drawing a scene after viev;ing a v;ritten description 
of it serves as an objective measure of his comprehension. In 
an experiment v;ith the scenes, v;e have vu'esented the material 
sequentially in a visual-snatial mode or as a verbal description. 
The locations of scene objects were presented either in a random 
or systematic order, spatially proximal stimuli being presented 
contiguously in the systematic ordering. The results indicate 
that spatial presentation is better than verbal, and that svs- 
tematic is substantially better than random. In addition, we 
have found moderate, but consistent relationships betv/een task 
performance and reading ability. Plans for extending this 
methodology are discussed. 

In the second category of research, two studies v;ere run 
to explore individual differences in oerceiveng sentence complex- 
ity. Multidimensional scaling and other more traditional analvses 
shov;ed a marked difference betV7een "good" and "poor" readers in 
their perception of a variety of sentence characteristics (for 
example, syntactic complexity, familiarity, and meaningfulness). 




2 



Final Fonor V. 



Project No. 1F055 
Contract Mo. OEC-6-71-0528- ( 500) 



Analysis of the Reading Comprehension 
Process; The develonment and Utilization of an 
Assessment Technique and the Preliminary Exploration of 
Individual Differences in Perceiving Syntactic Complexities 



Donald F. Dansereau 
Texas Christian University 
Fort North, Texas 76129 
June, 1972 



The research reported herein was oerformed persuant to a 
contract with the Office of Education, U.S. Department of 
Health, Education, and Welfare. Contractors undertaking 
such projects under Government sponsorshin are encouraged 
to express freely their professional judgment in the con- 
duct of the project. Points of vie^'7 or opinions stated 
do not, therefore, necessarily represent official Office 
of Education position or policy. 



U.S. DEPARTMENT OP 
HEALTH, EDUCATION, AND WELFARE 

Office of Education 

National Center for Educational Research and Develonment 

O 

ERIC 



3 



Tnblo or Contontr; 



Paqe Nunbors 



Lint of Tablon 9 

List of Fiquros 3 

T. Overviev/ /j 

II. PsCSGarch v;ith the "Scene" Methodology 4 

A. Introduction 4 

B. Method 12 

C. Results 14 

D. Discussion 15 

E. Conclusion and Planned Extensions 16 

III. ResearCii Assessing Individual Differences 21 

in Perceiving Sentence Difficulty 

A. Introduction 21 

B. ExDeriment 1: Perception of 22 

Syntactic Structure 

C. Experiment 2: The Effect of Reading 33 



Ability and Exnosure Time on Judgment 
of Complexity » Familiarity, and 
Comp rehens ibi li ty 

IV. References 4 /. 




4 



1 



I • 



LIST OF TABLES 



T-iblo number Title Pago number 



1 


ANOVA Summaries 


36 


2 


Mean Comprehensibility judgments 
for Variations in Meaningfulness 


39 


3 


Correlations of Comprehensibility 
vTudqments vrith Sentence Difficulty 


40 


4 


Percentages of Significant 
Consistency Correlations 


41 


5 


Judgment Intercorrelations 


43 




5 



2 



LIST OF FIGHRES 



Fiqure number 


Title 


Pace number 


1 


Street Scene 


10 


2 


Pier Scene 


11 


3 


Piierarchical Scene 


19 


4 


CLIFTON-OnOM Transformation Results 


25 


5 


Weighting of the Passive Dimension 
vs. Weighting of the Question 
Dimension 


26 


6 


Weighting of the Paraohrase Dimension 
vs. Weighting of the Question 
Dimension 


27 


7 


Weighting of the Negative Dimension 
vs. Weighting of the Question 
Dimension 


28 


8 


Weighting of the Paranhrase Dimension 
vs. Weighting of the Negative 
Dimension 


29 


9 


Weighting of the Passive Dimension 
vs. Weighting of the Negative 
Dimension 


30 


10 


Weighting of the Paraohrase Dimension 
vs. Weighting of the Passive 
Dimension 


31 


11 


Mean Comprehensibility Judgments as a 
Function of Sentence Difficulty for 
Time 1 


37 


12 


Mean Comprehensibility Judgments as a 
Function of Sentence Difficulty for 
Time 2 


38 



I. 



Overviovr 




'L'he activitii.'is accomplished under 0!;: Grant Mo. OEC-G-71- 
052'1-(509) can be divided into two general categories: (a) assess- 

'■lent and utilization of a new mothodology for measuring 
comprehension and (b) empirical analysis of individual differences 
in perceiving sentence difficulty. In addition to these activities, 
we also have continued to collect and annotate articles related 
to readability, reading comprehension, and psycholinguistics . 

To date, 250 references have been accumulated, 150 of these have 
been annotated, and 150 have been reproduced for future usage. 

The first category of v/ork, v/hich v/as the primary aim of 
the initial proposal, received the bulk of our efforts. Analysis 
of a second assessment methodology, utilizing multidimensional 
scaling, v/as proposed, but work on this approach was discontinued 
due to unforeseen difficulties in its implementation (see the 
January 15, 1972 Progress Report for details). Ive have just 
recently discovered v;ays to circumvent the problems with this 
latter technique and have included it in a recent proposal to 
extend our present work (see Section I E 2 of this document) . 

The second category dealing V'/ith perceived sentence difficultv, 
was initiated to explore an area v/hich as a result of our reviev/ 
of the literature, v/e felt v/as critical to understanding tlia 
comprehension process. Also it appeared that multidimensional 
scaling would have much stronger potential in this area. 

V!e have three major studies to discuss in this report. 

A number of smaller exploratory studies have been completed 
but their import is limited. Their main purpose v/as to provide 
information relevant to optimizing the designs of the major 
studies so they will not be explicitly dealt v/ith in this 
report. For clarity, the three studies v/ill be subdivided 
into the tv'/o major categories. 

II. Research v/ith the "Scene" Methodology 
A. Introduction 

Bormuth (1970) in extrapolating from his v/ork on middleclass 
suburban schools has estimated that 515 of the students graduating 
from high school are illiterate v/ith respect to the average 
materials used in their instruction and 46% are illiterate V'/ith 
respect to the average passage from the- nev^'s media. Even though 
the assessment techniques used by Bormuth may be questionable 
(see the discussion of the cloze technique in a subsequent 
paragraph), his estimates V'/ould be a basis of concern even if 
they v/ere substantially discounted. 
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Tn di;T c:usr 3 ing hi?; findings on lihoracy, normu''h (1970) 

!i;is rjointed ouh hlio need for a nodol of compraliension . "In 
ordtav to dovolop a reasorably offectiva theoro,' of (comprohcnsion) 
ii'.n traction , it is nacossary to hav^c fairly ciccuraho descriptions 
of hov; thosG (corapreliension) processes link together to produce 
a comnle;: behavior. Clearly, any effort to build such a tiieory 
u'ould have to draw upon linguistic theory of the structure of 
language and upon psychological knc'/ledge of how to analyze 
conplox laantal processes." Delineation of the underlying 
comprehension processes and their interactions will require 
the developinent of adequate techniques for the '.'nanipulation of 
variables hypothesized to influence comprehension and the 
subsequent measurement of the inagni\_ude of their effects. 

As v;ill be seen in subsequent sections, the methodologies 
dovelopeJ to explore the comprehension processes have not been 
aGe(.iuate. This research represents a step tov;ard developing 
and examining a methodology which we believe V7ill eventually 
remedy this situation. 

(1) Presently used Methodologies for Assessing Comprehension 

To set the stage for description of this methodology, 
we v;ill briefly dxscuss the shortcomings of presently used 
techniques in the context of a particular conceptualization 
of the comprehension process. 

(a) A description of the comprehension process and 
its relationship to retention. 

It is necessary to distinguish comprehension 
from transcription (typing and reading aloud) and rote 
memorization, in both of these cases there is a one to one 
relationship between the input and the subsequent output. 

Although the ability to transcribe is necessary for 
successful comprehension, the critical process in comprehension 
involves the translation of the stimulus material into an 
appropriate internal representation; aibstraction of theme or 
gist falls into this category. It is proposed that the essence 
of comprehension is the aibstraction of relationships. These 
may be relationships between words or combinations of words in 
a sentence, v;ords or combinations occurring in different sentences, 
as v/ell as higher order relationships between relationships 
derived at lower levels. For example, a relationship between 
Ann, dog and Tern may be aibstracted from the sentence: "Tom's 

dog bit Ann" and this relationship may, in turn, be related to 
the one derived from a later sentence; "Ann hit Tom with her 
xs/ounded hand." The nature of these relationsliips cannot be 
specified at present, but it appears that they are multidimensional 
and raay be adequately represented in a spatial memory model. Stored 



hlorarchlos of such relationship.! (that is, compound rolationships 
batv;een propressivoly lov/er order relationships) , along v/ith 
those relationships linking presently read material to that which 
v;as p,rcviously stored (that is, the integration of nev; knov;ledge 
with old through the development of naw relations) , would 
represent the comprehension of a passage. It should be noted 
that this description of comprehension differs only slightly 
from that of Frase (1969a, 1963b) and is presented as a frame- 
work in which to viev; previous and future research and not as 
a serious model. 

Although many researchers have called for methodologies 
which cuialyze comprehension independently of memory processes 
(for example, Carroll, 1971), it ' 3 clear that storage of 
the abstracted relationships both during and after the comprehension 
process is critical to successful performance in an educational 
sotting. In fact, some sets of relationships between concepts 
presented in a passage may be easily abstracted, but due to any 
nuinber of variables (for example, lack of concreteness, irrelevancy, 
disorganization) may be only subsequently retained for a minimum 
length of time. In addition, since during comprehension some 
effort to store the abstracted material presumably takes place, 
there is a type of time sharing which may be reflected in 
retention difficulties. For example, a complex passage (due to 
unfamiliarity, syntactic and orgcinizational complexity, etc.) 
may require substantial processing (both syntactic and semantic) 
in order to provide accurate comprehension. This increased 
effort may reduce the amount of processing devoted to storing 
the abstracted relationships. tJith simpler passages a more 
equitable balance between understanding and storage processes 
may be maintained. Therefore, differences between complex 
and simple passages may not appear under immediate recall 
conditions, but may be reflected only in retention difficulties 
over time. Naturally, an appropriate methodology will allov/ 
for the analysis of both comprehension and retention processes 
as well as their interactions. 

(b) Shortcomings of presently used methodologies 
for exploring comprehension and retention. 

Although there have been a variety of methods 
for measuring comprehension and diagnosing processing difficulties, 
there have been only a few classes of techniques which have been 
applied to multiple sentence connected discourse. 

(i) Free recall. This technique, which requires 
the £ to reproduce the essence of the presented material in his 
own v/ords, has probably the greatest face validity of any of 
the methods. The major difficulty is one of scoring the Ss 
output in an efficient, economical fashion. To date, this 
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probl-'-itn shill remains unsolved. Crothsrs (1971) is doveloping 
a procedure for mecisuring paragraph structures v/hich should 
eventually be applicable-.' to comparing paragraph input v;ith 
S s ' outputs. Until this approach is perfected, the free 
recall technique remains very difficult to use in both testing 
and experimental situations. 

(ii) Multiple choice tests. The most common 
technique used in previous research on comprehension, and in 
practical testing situations, has been the multiple choice 
test. This approach has been used in both comprehension and 
readability studies. Comprehension has been studied by factor 
analysing the responses to questions which purportedly measure 
different components of comprehension ability. For excimple, 

a typical set of questions to be administered follov/ing the 
reading of a passage, would be designed to test: word knowledge, 

ability to manipulate ideas and concepts in relation to one 
another, ability to grasp the author's expressed ideas, ability 
to identify the writer's intent or purpose, ability to follov; 
the organization of a passage, and knov;ledge of literary devices 
and techniques (Davis, 1944). 

The results of a number of analyses of this 
type (Davis, 1944; Conant, 1942; Thurstone, 1946; Hall and 
Rot' , 1945; Stoker and Kropp, 1960; Davis, 1968; Singer, 

19C lave been equivocal . It is unclear froin this research 
whether there is a single general faccor underlying comprehension 
or multiple factors. One major diffi.culty v;ith this research, 
which may to some degree account for the ambiguous results, is 
the apparent lack of valid and raplrcable comprehension test 
questions. In effect, different researchers do not necessarily 
agree on ;\;hat factors specific questions do measure, nor do 
they necessarily agree on the level of difficulty exhibited 
by each question (see Lorge, 1939; Bormuth, 1966; and Bormuth, 
1970) . This latter problem, concerning the difficulty of 
the test questions, also casts doubt upon the validity of the 
readability findings. 

(iii) The cloze technique. The cloze technique 
developed by Taylor in 1953 has been used extensively in 
analyzing the readability of passages. This method generally 
involves the replacement of every fifth word in a passage with 
an underlined blank. Subjects, v/ho have not previously read 
the passage, are asked to v;rite in the words they think v/ere 
deleted and their responses are scored correct when they 
exactly m.atch the deleted v/ords. A number of studies (Taylor, 
1953; Bormuth, 1967; and Rankin, 1965) have shown that the 
close technique is a valid and reliab3.e measure of comprehension 
difficulty when responses to post-reading questions serve as 
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criteria. Tor c:<a~\nle , lionauth (1962) found a .77 fo .07 
correlation ran.'jo at the fourth-grade lovo.1 v/hen individual 
clo/.e and raultiple ch.oice corr’.prehansion tests scores v/ere 
correlcitod over t!'.o neine itaterial. 

This technique has been used in lieu of 
post-reading questioning to determine the effects on reading 
difficulty of quantitative and qualitative differences in 
linguistic properties at the level of: vrords , clauses, sentences, 

and longer passage se^graents (Bormuth, 1966; Ruddell, 1965; 
MacGinitie and Tretrak, 1969; Bormuth, et al, 1970). A large 
number of these linguistic properties (for example, v;ord length, 
sentence lengtii, and number of clauses v/ithin a sentence) have 
high and predictable correlations v;ith reading difficulty as 
measured by the cloze technique. Hopefully, l:.novrledge of the 
linguistic factors v/hich facilitate or inhibit comprehension 
'•/ill allo'.v' us to construct graded reading material, diagnose 
reading difficulties, and infer processes underlying comprehension. 
The close technique, although objective, may be misleading in that 
it is difficult to determine exactly what is being measured. 
Validation of this technique, as discussed previously, has been 
accomplished by comparing its results v/ith that of multiple 
choice test questions. If the validity of the test question 
approach is in doubt, as users of the cloze technique have 
themselves pointed out, then the validity of the cloze technique 
is also in doubt. 



Evan if it is assumed that this technique 
is a valid indication of comprehension difficulty at some level, 
its usefulness would appear to be limited to the abstraction of 
relationships v/hich are in close proximity v/itnin the passage. 

In fact, MacGinitie (1961) found that cloze items are statistically 
independent when surrounded by five v/ords of context; that is, 
uninterrupted context five v/ords on either side of the target did 
not help in the restoration of the missing word. This would imply 
that the technique is measuring difficulty over small segments 
of text, and would be insensitive to the abstraction of higher 
order relationships, such as those concerned v/ith paragraph 
organization. In addition, the typical usage of the cloze 
technique would preclude the analysis of the retention aspects 
of comprehension. 

(2) Description of the Scene Methodology 

As an initial step tov/ard producing a more efficient 
methodology for analyzing and testing comprehension aind retention 
processes, common, aerial-view scenes were developed (Figures 1 
and 2). One purpose of using physically realizably concepts was 
to provide an objective, alternate format in which a reader can 
express his comprehension of a passage. For example, a reader 
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oC a set of instructions for constructing a crib, can show his 
comprohension of those instructions by answering questions about 
the material or by actually attempting to build the object. 

Although motor skills may confound the results, this Icitter pro- 
cedure should provide an objective test of the individual's 
comprehension of the passage or set of passages. In a like 
fashion, descriptions of movement, objects, visual scenes, etc., 
should contribute reasonable alternative measures of comprehension. 

A second advantage to using physically realizable concepts, 
especially scenes, is that the basic information (deep structure) 
can be presented in alternative ways - either as a verbal descrip- 
tion or as a picture. These alternative modes of presentation 
allow the experimenter to separate language decoding aspects from 
other comprehension processes. 

Previous work with physically realizable concepts has been 
very limited. A number of studies have used the ability of Ss to 
follov; directions as a measure of comprehension (Jones, 196 6; 

Brown, 1955; Shipley, Smith, and Gleitman, 1967). These studies 
have been limited to using only a few sentences and are hampered 
by the lack of an alternative mode of presentation. 

Other studies have tested comprehension by requiring verifica- 
tion against pictured referents (Gough, 1965, 1966; Slobin, 1966). 
This procedure is most closely aligned with the "scene" methodology, 
v/ith three exceptions; (a) picture verification has been applied 
only v7ith single sentences, (b) the verification process aliovrs for 
guessing which impedes the validity of any single response, and 
(c) presentation in alternative modes has not been explored (verbal 
vs. visual) . The "scene" methodology represents an extension of 
these studies on all three dimensions. 

To date, five scenes have been constructed, these consist of 
aerial viev/s of an airport, pier, street intersection, farm, and 
grocery store plus a i.nodified living room scene for practice pur- 
poses (examples are shown in Figures 1 and 2). These scenes, 
which consist of seven objects on a three item background, have 
been equated in a variety of ways, thus permitting generalization 
through replication. In addition to the scenes themselves, we have 
developed standard verbal descriptions of the scene objects and 
their locations. The number of v/ords, the number of sentences, and 
the sentence structure have been equated over all descriptions 
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( 3 ) Preliminary Assessmen t: of the " Scene " Methodology 

The philosophy underlying the present research is 
that a methodology can be most efficiently tested, even at 
the initial stages, by using it in an experiment designed 
to determine the empirical relationships between meaningful 
variables. With such an approach, there are, in addition to 
the resulting methodological developments, potentially strong 
empirical consequences which may lead to nev/ implications for 
a theory of comprehension. 

The specific independent varicibles investigated in tliis 
research and their operational definitions are as follov/s: 

(1) Syntactic complexity: This is operationally 

defined as the voice of the sentence or passage. Greenough 
and Semmel (1969) suggested, as have others, that active 
sentences because they are most frequently encountered are 
easier to comprehend and recall than passive sentences. 

(2) Order of presentation: The locations of objects 

in a visual scene can be described either in an order based on 
a logical progression through the scene using spatial proximity 
as the criterion, or in a random order. It was reasoned that 
the logical, systemmatic order should facilitate comprehension 
£ind recall. 

(3) Mode of presentation: The locations of the 

objects can be conveyed using either written descriptions or 
through the use of diagr&ims. If reading difficulties have to 
do with the eibility to translate information from a written 
coda into a visual code (in the case of descriptions of visual 
scenes) then good and poor readers should do equally well on 
the diagrammatically presented information, but not with the 
written descriptions. 

(4) Reading ability: This variable was defined in 

terms of the total reading score on the Cooperative English 
Reading Test which is administered to all freshmen entering 
Texas Christian University. 

B. Method 



Eighty students, fulfilling an Introductory Psychology 
course requirement at Texas Christian University served as 
Ss for a total of one hour per individual. The forty ^s with 
the lovrest "total reading" percentile on the Cooperative 
English Reading Test were arbitrarily assigned to the poor 
reading group. The remaining forty Ss were designated as members 
of the good reading group. The range of scores in the poor reading 
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group was from the 7th percentile to the G8th percentile and 
the rcange for the good readers was from the 75th to the 99th 
percentile. Good and poor readers were randomly assigned to 
four experimental treatment groups such that each group was 
composed of ten good and ten poor readers. In addition, ^s 
v\?ere assigned to groups with the constraint that the mean 
total reading percentiles for the four groups v.’ere approximately 
equal. 



Stimulus Material 



Aerial views of six familiar scenes (a city intersection, 
a farm, a pier, an airport, a living room, and a grocery store) 
presented cither in the diagrammatic format illustrated in 
Figures 1 and 2 or in the form of v;ritten descriptions comprise 
the stimulus set from which the stimuli used in the various 
phases of the experiment were selected. Each scene consists 
of a background v;hich is standard for all of the scenes except 
for labeling, plus seven objects superimposed on that background. 
For each scene, three components of the background are labeled. 

Procedure 



Each S, regardless of experimental treatment group assignment, 
was presented with four scenes (one of which was for practice) 
via black and white slides in the following manner. First the 
background with three labeled components was presented (either 
in diagrammatic or written format) for 24 seconds. For the 
v/ritten format this corresponds to a reading rate of 150 words 
per minute. Next, each of the seven scene objects was presented 
for eight seconds in conjunction with the background. One minute 
after presentation of the last object the Ss v;ere asked to list 
the seven objects and the three labeled background components in 
the order in which they v;ere presented. Upon completion of this 
task they were asked to locate and label the seven objects and 
label the three background components on an unlabeled background. 

The dependent measures corresponding to these tv;o tasks are total 
items correctly recalled, and total items correctly located and 
Icibeled. 

The four experimental groups, each containing good and poor 
readers represent the eight possible cells of a fully crossed 
2x2x2 design with reading ability, mode of presentation (diagrammati 
or written), and order of presentation (systematic or random) as 
factors. In order to investigate the effect of syntactic complexity 
upon retention, the written groups, both systematic and unsystematic 
were presented with descriptions of the scenes such that every 
other sentence (each sentence corresponding to the location of 
one object on the background) was in the active voice and the 
remaining sentences were in the passive voice. Thus for the 
written group only, an additional within-sub jects factor was 
introduced. 



In addition to this first task v/hich will be termed 
"Phase I" or the experiment, n second task termed "Phase II" 
v/as required of the in the written group. These S_s , 
upon completion of "Phase I" v/ere given a set of 3x5 cards 
each containing one sentence. The set of cards, taken 
together, described two additional scenes, with order of 
presentation again being either systematic or random.. For 
each one of the two scenes was presented entirely in the 
active voice an.d the second entirely in the passive voice. 

Ss wore provided v;ith unlabeled backgrounds and told to read 
the cards and after reading each card to locate and label 
(by initial letter only) the appropriate scene object on the 
blank background as quickly as possible. Total time to locate 
and label the elements of each scene v/as the dependent measure. 

The second phase of this experiment can be conceptualized 
as a 2x2x2 fully crossed design v/ith syntactic complexity 
(active vs. passive voice) as a v/ithin S factor. Syntactic 
com.plexity and the two scenes were appropriately counterbalanced 
to insure that the two v;ould not be confounded. 

C. Results 

Phase I 

2x2x2 fixed effects analyses of va.riance done on the two 
dependent measures v;ith reading level, mode of presentation, 
and order of presentation as factors indicated the following. 

(1) The diagrammatic presentation mode resulted in 
significantly batt jr recall than did the written mode (p < .01). 

(2) The systematic presentation order resulted in 
significantly better recall than did the random order (p < .05). 

(3) Mean items recalled and mean items located and 
Icibeled were consistently greater for good than poor readers in 
comparable treatment conditions but even so, this factor as well 
as all interactions were non-significant. 

2x2x2 analyses of variance done on the two dependent measures 
for the 'written group, v;ith presentation order, reading level, 
and semantic complexity as factors, indicated that only the 
presentation order variable was significant, v;ith systematic 
leading to better recall than random (p < .05). Again, good 
poor differences, though not significant, v;ere in the expected 
direction. 

A correlation of .50 was found betv/een each of the tv;o 
dependent measures in Phase I and the total reading percentile 
on the Cooperative English Reading Test. This suggests a X7eak 
but significant relationship between these measures. 



Phase II 



A similar 2x2x2 analysis of variance dona on the same 
thrca factors in Phase II revealed no significant effects cit 
al], . Again, good poor-differences were consistent and in the 
predicted direction. 

D. Discussion 

The lack of any significant effect involving the reading 
level variable (good vs. poor) v/as surprising. The reasons 
that the consistent differences between good and poof readers 
were not statistically significant may be due to: (1) lack 

of sensitivity of the dependent measures and/or (2) lack of 
consequential differences in reading ability between tlie good 
and poor readers. It was felt that correlational analysis 
would provide more detailed information on the relation 
between reading scores and retention of the scene information. 
The small but significant correlation of 0.50 suggests that 
reading test scores and performance on the experimental task 
are to some extent measuring the same underlying ability. Only 
further work v/ith this methodology suitably extended so that 
the dependent measures are appropriately sensitive will properly 
reflect the magnitude of this relationship. 

The insignificance of the syntactic complexity variable 
could also be due to the insensitivity of the dependent 
measures, since as we have already stated, differences between 
the retention of active and passive sentences have been 
demonstrated in previous studies. It was hoped that syntactic 
complexity v;ould interact with reading test scores, showing a 
greater disparity in perforraance between good and poor scorers 
v;ith the more complex passive sentences than with the active. 
Unfortunately, it appears that the dependent measures were not 
sufficiently sensitive to record such an interaction. Further 
work 'With more refined measures will aid in determining the 
effects of this variable. The second category of studies 
accomplished during the grant period bears on this issue (see 
Section III) . 

The significant order effect was consistent with the 
findings of Oakan, Wiener, and Cromer (1971), however, the 
lack of any interactions of this variable with reading level 
is inconsistent with their hypothesis that poor readers, 
because of organizational disability are less able to comprehend 
information that is not already well organized. It should be 
noted, hov;ever, the insensitivity of the dependent measures in 
this research may well be masking this interaction as well as 
other expected effects. 
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Thn rnocl'3 of. prcsGntation o.ffoct vaa surnr.’.fsinq in that 
it occurrod v;ith both donandant: mnanurns - racall of f^cone 
itcns and locating and laboli.ng. ’’•Thile it is obviour, that a 
visual presentation is a noro direct nethod of convovinq the 
spatial information, it v;as not anticipated that it t'/ould so 
greatly facilitate the simnle recall of the items names. In 
retrospect, this can be explained in one of two wavs. First, 
if the visual presentation facilitates a non-verbal storaqe 
of the object locations, then more verbal storaqe would be 
free to store the item names. Second, if the visual presentation 
conveys the spatial aspects of the scenes m.ore quickly, then 
the Ss v;ould have more time in v;hich to rehearse the item 
names . 



E. Conclusions and Planned Extensions 

At this point, v;e are forced to conclude that the utilitv 
of the "scene" methodology as a reading assessment techniaue 
is somewhat limited. It appears that the task, as presently 
structured, is not povrerful enough to detect subtle, theoreti- 
cally meaningful differences in performance. We feel that v;e 
have found a solution to this lack of sensitivity and v;ill 
discuss it in sub-sQction (2) . Prior to this discussion, 
however, v;e v/ould like to describe some plans for future v;ork 
in an area in vfnich v;e feel the presently developed scenes 
will be extremely useful. 

(1) Temporal Organization ( Order of Presentation ) 

In many practical cases, there is a Pressing need to 
present information in a way which v;ill maximize performance 
in tasks requiring comprehension, retention, or utilization. 

In pursuit of this objective, much research has focused on the 
effects of order of presentation (or temporal organization) . 
Although there has been a large amount of research on this 
variable, especially in the domain of simple retention studies, 
the results have not been integrated sufficiently nor have the 
research methodologies been powerful enough to permit substan- 
tial generalization and application. 

Our work in pattern recognition, human memory, and 
reading comprehension has required us to looh very carefully 
at the effects of stimulus ordering in these tasks. As a 
result of this research and of a preliminary review of the 
literature on stimulus ordering in a broad range of tasks, we 
have delimited an area of study which we feel nov; deserves 
substantial attention. The area lies between simple retention 
studies (serial list learning, free recall, and recognition 
memory) and the more complex tasks of meaningful prose compre- 
hension and retention. Previous research on stimulus ordering 
has focused heavily on simple retention studies; a good deal 
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is knov;n about the serial position effect, isolation effects, 
and conceotual clustering as they occur in these tasks. Fur- 
thermore, a number of sonhisticated models - soma quite successful 
have been devalooed to account for these phenomena. 

Once we move out of this area, however, to tanks 
involving more complex underlying stimulus organizations (for 
examnle, hierarchial structures) and more comolex processing, 
sophistication of the work decreases. In particular, this 
decrease is quite marked in the transition from simple retention 
to prose comprehension and storage. It appears that tv;o kinds 
of activity are needed in order to extend our knotr^ledga of 
temporal organization to more complex tasks. First it is 
necessary to specify the underlying stimulus organizational 
structure which must be sufficiently transmitted J.n order to 
produce effective Performance in these tasks. This organiza- 
tional structure will generally consist of a set of items and 
their interrelationships associated v;ith a particular content 
area (for example, biological taxonomy). Structures can 
take the form of lists, hierarchies, and netv/orks plus a variety 
of higher order conbinations of these simple structures. The 
ultimate requirement of this phase is a taxonomy or organisational 
structures within v/hich a v,;ide variety of tasks and content 
areas can be categorized. 

In the second kind of activity, each organizational 
structure must be studied separately. Tlie effects of varying 
the temporal organization of the information would be assessed 
and the results integrated into a model of human processing. 

Since the "scene" stimuli have a two-dimensional 
network structure v;hich is more complex than list structures 
normally used in simple retention studies, they provide a good 
basis for initiating this second type of activity. As stated 
earlier, the results from our "scene" experiment indicate that 
spatial presentation is substantially better than verbal, and 
that systematic is far better than random regardless of the mode 
of presentation. These results raise a number of questions 
which we have proposed to explore (a proposal has bean submitted 
to the Office of Naval Research) . 

(a) Does Pythagorean proximity provide the most 
efficient means of temporally organizing the material? Alternative 
orderings vrhich will be explored are based on: "city block" 

proximity and presentation of various features of the Gestalt 
figure, such as presenting corner items priou to middle items, 
et cetera. 



(b) What aspects of the human processing system 
determine efficient ordering in this task? The results from 
question one above will be analyzed with this in mind. 



(c) What are the effects of exoandincf the number 

of stimulus items and manipulating the background configuration? 

The generalizability of the orevious results vrill be tested. 

(d) Hov; oervasive are the effects of individual 
differences? Other measures of individual difference v/ill be 
included in the proposed studies. 

Answers to these questions v/ill presumably give 
us soma indication of temporal organization effects in simple 
tv;o dimensional netv;orks. 

(2) Development of Hierarchical Sce.tes 

Hierarchical structures predorainate in educational 
settings; the structures of textbooks, courses, and curricula 
are almost always hierarchical. In addition, although most 
physically relizable concepts are not inherently hierarchical 
in nature, we often find it convenient to describe them hier- 
archically (for example, instructions for games and filling 
out tax returns). In the theoretical domain, investigators 
have hypothesized that cognitive plans for problem solving and 
action are hierarchically organized (Miller, Gallanter, and 
Pribram, 1960) , and that semantic memory consists largely of 
interlocking hierarchies (Miller, 1969; Collins and Quillian, 

1969, 1970; Kintsch, 1970; Handler, 1967). 

Given the apparent importance of such structures, it 
is surprising that only a fev; studies have been directed toward 
analyzing the comprehension and/or retention of material orga- 
nized in this fashion. Bower, Clark, Lesgold, and Winzenz, 

(1969), Rourke, 1971, and Segal, 1969 used hierarchies of words 
in free recall experiments and shox^red that the hierarchical 
organization "built into" a set of words to be learned can be 
recovered in the recall organization. However, none of the 
studies vre are familiar with have manipulated structure or 
presentation variables, nor have they extended the free recall 
findings to the area of prose comprehension and retention. 

In order to fill this gap and to provide a more 
sensitive comprehension assessment methodology, we have prelim- 
inarily developed hierarchical "scenes" (for example, see Figure 
3) and have proposed to further develop and utilize these 
stimuli in theoretically meaningful experiments (proposals 
have been sent to Child Health and Human Development, National 
Institute of Mental Health, and the National Science Foxandation) . 

The stimuli, which v/ill be presented both diagrammatical ly 
and in the form of verbal descriptions, will reflect various 
conditions of hierarchical organization in the information to 
be presented. The scenes will be constructed to the specifica- 
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tions of Rach individual experiment. They will be analogous 
to the one shown in Figure 3, but naturally v;ill deviate some- 
what denending on the particular experimental maninulations 
employed. In addition to these aspects of the methodoloqv, 
several other properties may be noted. 

The scenes ^^rill he artificial to minimize subject 
differences based on previous experience and to permit con- 
struction of alternate forms in v;hich the stimuli are identical 
except for the object neimes. These alternate forms will he 
used to provide replications of each experimental condition. 

The scenes also exhibit several other manioulable 
properties. In particular, there vrill exist a number of 
dimensions along which subjects may structure their memories; 
these dimensions include object size (represented by the size 
of circle indicating its location) , spatial proximity, inter- 
connectedness (via roads, et cetera), and object class (type 
of item, such as business, recreation, et cetera) . These 
dimensions can be arranged so as to be correlated with the 
locations of the objects in the hierarchical structure or 
independent of the locations. Presumably, having a number 
of dimensions correlated v;ith the hierarchical structure will 
facilitate the use of this structure in memory. 

With regard to the verbal description of each scene, 
the sentences v;ill be generally in one of two forms: 

"'A', vrhich is a large town, is located immediately 
to the v;est of the mountains and just north of the railroad 
tracks"; or 

"Immediately to the west of the mountains and just 
north of the railroad tracks is the large town of 'A'." 

Since we are not explicitly interested in sentence processing 
we will not attempt to manipulate variables of sentence 
structure, but will use the two forms intermixed randomly to 
provide variety throughout the passage. 

The conversion of a scene into verbal description is 
straightforward, although the resulting prose is hardly elegant. 
We are not entirely satisfied with our present method of conver- 
sion into verbal form and plan to explore alternative methods, 
including techniques to obtain representative samples of text 
produced by persons asked to describe the scenes. 

As with the previous, non-hierarchical scenes, 
the S will locate and label the objects and the components of 
the background. An unlabeled background will be provided for 
this purpose. A variety of scoring procedures will be used 
to assess performance on this task; these include Pythagorean 



distance between objects in the original and in the renoonso 
"scones", and absolute number of correct items. This free 
recall of information should orovide a direct measure of hov/ 
well ^s comnrehended and retained the original material. 

Multidimensional scaling v/ill be used to further assess 
the organization of the nresented^material in memory. With 
this technique (see Torqerson, 1958, for a more comoletc 
descrintion) , Ss will be asked to judge the relatedness of 
all possible pairs of scene objects. The resulting data v;ill 
be analyzed by the INSCAL Program (Carroll and Chang, 1970). 

The outnut from this urogram will be an inferred multidimen- 
sional representation of the Ss' memory structures. Previous 
vrork with this technique (Dansereau, Fenkar, and Evans, 1970), 
has shov/n it to be of s.ubstantial value in determining how 
materia L ss stored. In particular, important dimensions of 
storage can be delimited. 

In order to provide a basis for concurrent validation of 
the assessment of the inferred memorv structure, the £ v;ill 
be asked to access the stored material along a number of 
dimensions. Questions such as, "name the three largest objects 
in the scene" or "name the three most oroximal objects" will 
be asked. Latencies and errors in responding can then be used 
to drav; inferences as to the nature of the internal structure 
and to detect changes in this structure as the result of ex- 
perimental manipulations. Findings with this approach should 
be strpngly related to the results of a multidimensional 
scaling analysis. 

III. Research Assessing Individual Differences in 
Perceiving Sentence Difficulty 

A. Introduction 

One aspect of the comprehension process which v;e feel 
deserves attention is that of on-going decision-making. It is 
hypothesized that a good reader can vary his processing mode 
to iuit the difficulty and importance or relevance of the 
material. The basic decisions are concerned with determining 
the appropriate time to shift from one processing mode to 
another. Decision to scan, concentrate, assimilate, et cetera, 
if made appropriately, produce greater reading efficiency. 

These decision processes have been studied only in superficial 
or aoplied experimental contexts under the label of "reading 
flexibility" (Bowman, 1966; Braara, 1963; Braam & Berger, 1968; 
Laycock, 1955; McDonald, 1965). In general, very little 
attention has been paid to individual differences in processing 
critical to decision-making aspects of comprehension. 
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Two studies were specifically designed to exanine the corre- 
lates individual differences in one critical asnect of 
decision-making — the oerceotion of sentence difficult’/. 

It is clear that highly difficult material will require 
more detailed processing than less complicated material. 
Therefore, accurate perception of the underlying difficulty 
of the material is very important in efficient utilization 
of the different processing modes. 

These tv/o studies will be discussed separately below. 

The first deals with individual differences in perceiving 
syntactic structure. Since it has been shovm that sentences 
differing in structure vary with resoect to processing 
difficulty, it is clear that accurate perception of structure 
is necessary for efficient shifting of processing modes. 

The second study includes variables, in addition to syntactic 
complexity, which are thought to influence sentence difficulty 
(for example, word familiarity and content meaningfulness.) 

B. Experiment 1; Perception of Syntactic Structure 

In addition to potentially causing difficulties in making 
processing decisions, the inability to appropriately use. 
grammatical structure may have other implications. In 
particular, Weinstein and Rabinovitch (1971) examined differ- 
ences in the extent to which "good" and "poor" readers v/era 
able to take advantage of grammatical structure in learning 
lists of nonsense syllables. Trvo groups of Ss v/ere given the 
task of committing to memory lists of nonsense syllables 
which varied in the amount of grammatical structure that was 
present. One kind of list was completely unstructured. A 
second was in the form of a pseudo-sentenca ; that is it 
contained English function words and bound morphemes such that 
if the nonseneje syllables had been replaced by appropriate 
English words, grammatical sentences would have resulted. 

An exeunple of this type of construction (v;ith the bound 
morphemes underlined) is; When they sivoled the veg, they 
han ashed zafly . A third type of list was Identical to the 
second except for the ommission of the bound morphemes. 

Epstein's (1961, 1962) work demonstrated that the pseudo 
sentences significantly facilitated learning and the addition 
of bound morphemes resulted in significantly faster learning 
than the pseudo sentences alone. Weinstein and Rabinovitch 
shovred that while this is true for good readers, significant 
differences in learning rate are not observed with readers 
scoring at or belov; the 50th percentile on the Gates Reading 
Test. This suggests that sensitivity to or ability to process 
sentence structure may be' a factor in determining reading 
prof iciency r 
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Thin findinq suqqcsts a nunbcr of possihilition , the 
firr.t of v/hich is that nersons with ceadinq difficulties, 
aside from vocabulary nroblems, may be relatively inseusi- 
tivo to structural information. The result of this sort of 
Drol^lem might be that tht oerson would, because of time 
constraints, fail to comnrehend sentences, which with more 
time, he could have understood. 

A second and more interesting oossibility is that noor 
readers might incorrectly use structural cues and thus assign 
a different structure and thus an incorrect meaning to certain 
sentences. In effect, this is to say that the grammar of 
the poor reader might simply not be the same as the grammar 
of his society. 

Ir order to exolore these possibilities a multidimensional 
rank order technique vras used to determine perceived oroximity 
between all possible oairs of a set of 16 sentences. These 
sentences were 8 transformations derived from each of tv^o 
kernels (the tv;o kernels are oaranhrases of each other) . 
■iultidinensional scaling (Carrol and Chang, 1970) was used 
to derive spatial renresentations of the sentences for both 
"good" and "poor" readers. These spaces v;ere then compared 
to determine differences related to reading ability. 

(1) Method 
Subjects 



Thirty-one General Psychology students served as 
subjects in order to fulfill the course requirement. Seventeen 
of these Ss were classified as "poor" readers (total reading 
percentages on the Cooperative English Reading Test ranging 
from 10% 71%) • fourteen were classified as "good" readers 

(total reading percentages ranging from 77% to 99%) • 

Stimuli 



The stimuli used in this experiment were 16 English 
sentences. The sentences varied from 9 to 12 vrords in length 
(depending on the transformation) and contained high familiarity 
words (selected from the set of "A" words in Thorndike and Loge, 
1944). Of the sixteen sentences, 8 were fren one kernel (K) , 
oassive (P) , question (Q) , negative (N) , PN, ?Q, MQ, and PNQ 
family and the other 8 were from a second K, P, N, ... PNQ family 
which V7as based on a paraphrase of the first kernel. The 
second kernel was derived from the first bv synonym substitution. 
These sentences were punched and printed on IBM cards in order 
to facilitate the administration and analysis of the multi- 
dimensional rank order task. 
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Prododuro 



Kach ^ pro!^ontod v;ith 16 natn of: 16 sontoncor;, 
each f^entencc nrinted on a sennrate card. The first card 
in each set (a pink card) served as the "reference sentence" 
for that set. The S's task v/as to decide which sentence in 
each set v;as most iTkcly to be confused with the "reference 
sentence". That sentence v/as placed immediatelv behind the 
nink card containing the "reference sentence". Next, the S 
selected the sentence which was the second most likely candi- 
date for confusion and placed it behind the first sentence, 
and so on. Ss went througli all sixteen sets of cards in this 
manner without time constraints (average running time was 
approximately 40 minutes). 

(2) Results and Discussion 

The rank-order data v;as analyzed by the INDSCAL 
multidimensional scaling program (Carrol and Chang, 1970). 

A four dimensional solution was -judged optimal, v/ith the 
follov/ing dimensions predominating: negative - non-negative, 
active - passive, question - non-question, and oaraphrase - 
non-paraphrase. This solution closelv replicates the findings 
of ciifton and Odom (1965) , who found a relationship betv/een 
transformations which is best characterized bv Figure 4. 

There are some rather striking differences between 
good and poor readers in performing the rank-order teisk. 

These differences are clearly shown by a series of oain/ise 
dimension plots (Figures 5-10) . In all graohs the pentagons 
represent poor read rs, the stars reoresent good readers, 
and the values alor the axes reoresent the weighting of the 
dimension in an individual's decision process. A number of 
observations can be made; (a) Bayesian and multiple discrimi- 
nant analyses lead to highly significant separations of good 
and poor readers, therebv indicating substantial differences 
in the processing strategies employed by th^se groups of Ss; 

(b) good readers exhibit less inter-subject variability (f^or 
example, they tend to cluster more tightly) than poor readers, 
thus indicating strategy communalities among good readers which 
do not exist among poorer readers; (c) good readers weight 
the question dimension more in their judgments than do poor 
readers; cind (d) good readers weight the passive dimension less 
than pocr readers. It should be noted that these differences 
in performance cannot be directly attributed to differences 
in intelli.gence , since reading scores and intelligence are 
only negligibly correlated. 
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(3) Conclusion 



It can be concluded that good and ooor readers do 
not perceive syntactic or grammatical structures in the same 
way. These_ results strongly supoort the findings of V/einstein 
and Rabinovitch (1971) , and, therefore, imply that some reading 
difficulties, even at the college level, may be due to inaopro- 
priate processing of grammatical structures! In particular, 
these processing differences may be reflected in differences in 
reading flexibility. 



C. ExDGrimsnt 2: Ths Effect of Reading Ability and Rxnosure 

Time on Judgment of Comnlexity, Familiarity, and 
Comnrehenf;ibility . 

The orindipal aim of this study was to determine if 
"good" and "poor" readers differ in their oer' eotion of 
sentence difficulty under varying time conditions. Specifi- 
cally the following variables were manipulated or measured: 

Sentence difficulty - Eight sets of stimulus 
sentences (48 in alT} were produced varying in tx>;o levels of 

(a) syntactic complexity (active vs. passive) (Iloneclc, 1971), 

(b) word familiarity (high vs lov 7 frequency words derived 
from the Thorndlke-Lorge word frequency list) (Pavio, et al., 
1968) , and (c) content meaningfulness (semantically logical 
vs. semantically illogical sentences) (Danks, 1969 for similar 
procedure) . Previous vjork indicates that variations along 
these dimensions will contribute to the perceived difficulty 
of a sentence (Honeck, 1971; Danks, 1969; Pavio, et al., 1968). 
Each sentence contained from nine to eleven I'Tords with equal 
numbers of nouns and verbs in each. The sentences were ran- 
domly ordered for experimental presentation. 

Individual differences - The thirty Ss v;ith the 
lowest "total reading^ percentile on the Cooperative English 
Reading Test v>7ere classified as "poor" readers; the thirty ^s 
in the highest percentiles were designated "good" readers. 

The arbitrary cut-off point for the lowest "good" reader 
score was the 75th percentile, whereas the highest "poor" 
reader score was at the 56th percentile. 

Time considerations - In order to examine possible 
time contingencies which relate to decision-making ability, 
the stimulus sentences were presented under a fast and slow 
time condition. A pre-experimental test suggested that a 
1.5 second exposure of each sentence would be roughly equiva- 
lent to a "scan" or "skim" processing mode; whereas a 4 
second stimulus exposure would allo\^ for more detailed pro- 
cessing. It was hypothesized that individual differences 
might be more readily detected across these two varying time 
conditions. That is, "poor" readers might misperceive sentence 
difficulty under "fast" conditions, but not under "slow" 
conditions, whereas "good" readers would be accurate under 
both conditions. 

The major dependent variable in this study was subjective 
comprehensibility judgments. A few recent studies have indica- 
ted that comprehensibility judgments correlate quite well 
with objective measures of comprehensibility (Schwartz, et al., 
1970) . This choice of dependent measure provided us with an 
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ODportunity to investigate another, closelv related asoect 
of the comorehension process. Snecifically , it has been 
pointed out by several investigators (for example, Carroll, 

1971) that little is known about the subjective dimensions 
which underlie these judgments. Therefore, a close examina- 
tion of the relation between Ss' judgments of comprehensibility 
and the experimental control of the aforementioned sentence 
parameters, might bring information to bear on this issue. 
Furthenr.ore , Ss were also asked to provide judgments of 
syntactic complexity and content familiarity. Relationshios 
among the dependent measures should nrovide further evidence 
as to the subjective basis of comprehensibility judgments. 

(1) Method 

Subjects - Sixty Introductory Psychology students at 
Texas Christian University served as subjects. Their partici- 
pation was a course requirement. 

Procedure - There were tv;o parts to this experiment, 
corresponding to the previously mentioned time conditions. 
Sentences v;ere mounted on slides and presented under mechanical 
control on a screen in front of the S. In Part I all Ss, run 
in groups of thirty, were given a 1."? second exposure of each 
sentence, followed by a 15 second interval. During this 
interval they v;ere to make judgments (on a scale from 1 - 10) 
of (a) syntactic comnlexity, (b) content familiarity, and 
(c) comprehensibility of the previous sentence. 

T;;o V7eeks later, the same Ss were used in Part II. 

Part II v\;as identical to Part I with tx70 exceptions: (a) The 

exposure time for each sentence was 4 seconds instead of 1.5 
second, and (b) After each stimulus presentation in Part II, 

^s X'/ere asked to judge the veracity (True or False) of a 
paraphrase of the previous sentence. This second addition 
x«;as included as a check to keep the Ss honest. 

(2) Results and Discussion 

At this point it is most informative to subdivide our 
data analyses into three categories which indicate their 
relationship to; the accuracy of perceiving difficulty, 
consistency, and the bases of the comprehensibility judgments. 

Accuracy of perceiving difficulty - Tx^o 2 -way ANOVA's 
(corresponding to the two time conditions) have been carried out. 
The first factorwas reading ability (good vs. poor), and the 
second was the vrithin subject factor of sentence difficulty 
(the eight groups of sentences classified according to exoeri- 



mental control) . With conorchensibility judgments as the 
cleoendent measure, significant results v;ere obtained on 
sentence difficulty, but not on reading ability. An inter- 
action between sentence difficulty and reading ability 
occurred in the slower time condition (see Table 1) . A 
Diet of the mean judgment under each level of sentence 
difficulty, with these, levels clustered and arranged in 
aporoximate order of assumed difficulty, illustrates more 
clearly the variation in judgments across cells. (see 
Figures 11 and 12) . 

These results suggest that both good and noor 
readers seem to be responding to the exoerimental manioula- 
tion of sentence difficulty, but that the "good" readers 
appear to he somevrhat more sensitive to these underlying 
dimensions, especially "meaningfulness" (note the differences 
in Table 2) . To provide further evidence for this notion, 
a series of Pearson product-moment correlations were calcu- 
lated. These correlations were between Ss’ comorehensibility 
judgments and sentence difficulty. Stimulus sentences vrere 
placed in one of four categories — ranging from easiest to 
most difficult — according to the experimental manipulations 
of the sentence oarcimeters. Correlations between these "levels 
of difficulty" and the comorehensibility estimat of that 
sentence for good and poor readers xmder both time conditions 
v/ere calculated (see Table 3). Again, all correlations are 
significant at p ^f.Ol, (N=1200) . Note that good readers have 
somewhat higher correlations than ooor readers, indicating 
a slightly more accurate perception of the underlying oarameters. 

Consistency of subject’s judgments - As a means of 
measuring Ss judgment consistencies, £s judgment on each 
sentence in Time 1 was correlated with his judgment on the 
Scime sentence in Time 2. These correlations v/ere calculated 
for good and ooor readers under all judgment categories (see 
Table 4) . Of note here is hhe apparent greater consistency 
of good readers across the different time conditions. 

Dimensions underlying comorehensibility judgments - 
A thorough inspection of Figure 12 (Time 2 ANOVA graph) 
indicates that experimental manipulation of the meaningfulnass 
dimension seems to have the greatest effect on comprehensibility 
judgments. Notice the largest deviation from a best-fitting 
straight line occurs when experimental control of meaningful- 
ness is different from the familiarity and complexity dimensions 
(middle of graoh) . That is, when complexity and familiarity 
have "easy"values , and meaningfulness has a "hard value, the 
comprehensibility judgment is considerably lower than the line 
of best fit; in the opposite control situation, however, 
(complexity and familiarity are "hard", meaningfulness is 
"easy") comprehensibility judgments are considerably higher 
than the line of best fit. 
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‘FACTOR A = READING ABILITY 
FACTOR B = SENTENCE DIFFICULTY 
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MEAN COMPREHENSIBILITY JUDGEMENTS AS 
A FUNCTION OF SENTENCE DIFFICULTY 
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COMPLEXITY 
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61.9 


FAMILIARITY 
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71.4 


COMPREHENSIBILITY 


86.9 


66.7 



PERCENTAGE OF SIGNIFICANT 
CONSISTENCY CORRELATIONS 

(a consistency correlation vras calculated for 

each subject) 

44 

TABLE 4 



In order to orcvido further ovidenco for this finding 
cl series of Pearson nroduct-nonent correlations v;ere calculated 
to determine the intercorrelations of the three different 
judgment categories (comolexity , familicarity , and comnrehonsi- 
bility) . Correlations for good and noor readers in both time 
conditions v/ere obtained. Although all correlations v;ere 
significant at p < .01, (N=1200) it is imnortant to further 
note the large correlations betx'/een familiarity and 
conorehensibility for both good and noor readers in both time 
conditions (see Table 5) . One highly nlausible interoretation 
of this result is that this correlation is mediated bv meaning- 
fulness. That is, ^s nerceive content familiarity and meaning- 
fulness as being synonymous. 

(3) Conclusions 

In general it apnears that one attribute that distin- 
guishes "good" vs. "poor" readers is their ability to accurately 
and consistently perceive the difficulty of sentences. This 
difference in ability may manifest itself in the "flexibility" 
v;ith which the readers deal with material varying in relevance 
and difficulty. Experiments to further delimit individual 
processing differences in decision-making are in the planning 
stage . 



An additional finding from this study is the importance 
of meaningfulness relative to v7ord familiarity and syntactic 
complexity in influencing coranrehensibility judgments. This 
suoports and expands the finding of Danks (1969) . Individual 
differences in perception of this dimension v/ill be explored 
in further studies. 
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